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Abstract 

An algorithm is given to factor an integer with iV digits in ln m N steps, with m 
approximately 4 or 5. Textbook quadratic sieve methods are exponentially slower. 
An improvement with the aid of an a particular function would provide a further 
exponential speedup. 



Factorization of large integers is important to many areas of pure mathematics 
and has practical applications in applied math including cryptography. This subject 
has been under intense study for many years pQ; improvements in the methodology 
are especially desired for computational reasons. 

Given an integer N composed of approximately ln 10 N digits, standard text- 
book quadratic sieve methods generate the factorization of the number into primes 
in roughly 

e av/lniVlnlniV 

moves. The steps require manipulations of large integers, of the size N, with bit 
complexity of approximate In N. The number a is approximately 2, depending on the 
variant used pp. 

The presentation in this work generates a computational method to obtain the 
prime factorization in 

ln m N (2) 

moves with integers of the same size. The factor m is specified by the convergence of 
the solution to a set of polynomial equations in In N variables, which numerically is 
approximately m = 3, after the root selection is chosen from small numbers to large 
(see, e.g. 0). 

Given a function CV that counts the number of prime factors of a number, i.e. 

N=f[p k J U) C N = ±k,, (3) 

j=l i=l 

the factorization of the number iV could be performed in approximately C™ steps. 
The bound on the number of prime factors of an integer N is set by hi2 N, the product 
of the smallest prime number 2. The number of primes smaller than a number N is 
approximately N/ InN, and the Cn is roughly In In N. Hence, given the function Cn 
a further exponential improvement is generically given. However, this function would 
drastically simplify the factorization of large numbers possessing only a few prime 
factors. The upper bound of Cn ~ In N describes the case discussed in the previous 
paragraph. 

1 



(1) 



Consider a number with exactly Cn factors. This number projects in base x onto 
the form, 

Cjy 

N = J2a l x t . (4) 

i=0 

The polynomial form in (jlj) admits a product form, 

N = l[(c j x-b j ) , (5) 

i=i 

with CjX — bj integral. The number scales into the form, 

Cjv 

N = 1 U(a j x-l) , (6) 

i=l 

in which there are Cn numbers ctj and a number 7. The same integer has the prime 
factorization 

Cjv 

N = Up*m, (7) 

«=i 

with the set a(j) containing possible redundancy, for example, p a m = p a <2) = 2. 
Given an integer base x the solution to the numbers bj generate the prime factors, as 
long as the value is correct. 

Two examples are given. First, 15 = 3 2 + 2(3) = x(x + 2), which solves for 
the prime factors 3 and 5. Second, 10 = 2 3 + 2 = x(x 2 + 1), which solves for the 
prime factors 2 and 5. In the second example, even though there are two factors, the 
polynomial is a cubic with a vanishing zeroth order term; the origin of the cubic is 
that there is a complex root. 

The polynomial base form of the number, i.e. iV = J^o-i^ 1 , will not in all cases 
factor into the form (jHJ with real coeffcients Cj and bj. However, because the coeffi- 
cients are real, the roots will enter in complex conjugate pairs. The product of these 
complex conjugates form a positive number. In order to test all possible cases, in- 
cluding the presence of a factor being represented as a product of two complex roots, 
all numbers from Cn to 2Cjv should be examined. Potential complex roots come in 
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pairs, and the maximum number of factors could take on the form of Cn products of 
two complex numbers. The cost of the additional complexity is of order unity. 

The expansion of © generates a set of algebraic equations relating the integer 
coefficients bj to those in aj. The form is, 

7«ia 2 • • • a CN = a CN 

7 = «o , (8) 

in which the combinations 



Cj x - bj (9) 

must converge to integers or into pairs with the product being an integer, and for 
the maximal factorization to prime numbers p (of which there are an approximate 
N/lnN of them for the number N). The determination of the numbers <Xj must 
be rational as (cj/bjX — 1) = n/bj, with n integral. The other case of interest is 
when Cj/bj is complex and the relevant condition is \{cj/bjX — 1)| 2 = n 2 /6 2 ; this is 
not satisified in general by complex rational numbers. However, one may square the 
number iV and then all terms in the product must be rational. 

The number 7 must be an integer or the square of 7 must be an integer, ac- 
cording to the presence of complex terms (roots) which square to an integer. If the 
solution does not satisfy these criteria, then there is not a valid factorization iV into 
integers. Given rational solutions atj = Cj/bj and the 7 = JJbj, the straightforward 
multiplication of 7 into the Cn factors generates the factorization into N1N2 . . . Nc N , 
via eliminating the denominators in the individual terms of the rational numbers. 
The complex root case allows the numbers to be determined as Nj = Nj >+ Nj . 

Solving these equations generates the prime factorization of the integer A" into 
the set of primes p a (j)- Numerically, solving a set of equations in n variables typically 
has convergence of n 3 if the initial starting values are chosen correctly. 

In the case of Cn not known, but bounded by In A/", all cases of interest from 
the test cases of Cn = 1 to Cn = \nN may be examined, at the cost of duplicating 
the process by the bound In AT. Typical true values of Cn are expected for generic 
numbers to be smaller than the bound, e.g. In In A/". The cases from Cn to 2Cn must 
also be examined in order to take into account the pairs of complex conjugates. 

Computationally exploring all of the cases from Cn = 1 to Cn = 21nA^ (e.g. 
~ Cn) for integer bases x finds all product forms of the integer A^ into products 
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N = N t N 2 N = N ± N 2 N 3 (10) 



N = N 1 N 2 ...N Cn = YIp <tU) ■ 



(11) 



3=1 



Solving for bj and Cj (e.g. aj = bj/cj) in terms of aj generates either integral values 
or non-integral values, or pairs or complex conjugates. In the case of integral values 
for all the bj parameters, the base x is resubstituted into the factors CjX — bj of the 
total product, 



to find the values of the individual N n . The integral solutions generate the various 
factorizations. In the case of complex conjugate pairs, the integrality is tested by 
multiplying the individual terms, 



and determining if it is an integral (the latter has to be rational). The maximum Cn 
that results in integral values of CjX — bj gives the prime factorization. Non-integral 
solutions to CjX — bj do not generate integer factorization of the number N into Cn 
numbers. 

In the computation for the roots, the parameters aij and 7 are determined. The 
integrality of the CjX — bj translates into Oij being a rational number (when not a 
complex root allowing the complex CjX — bj square to be an integer). The rationality 
allows the denominators of all the atj parameters to be extracted and used to multiply 
the prefactor 7. The case of the complex roots may be examined also by first taking 
the complex square and examining for integrality; the denominators must also be 
taken out of the products. 

Computationally testing all cases from Cn = 1 to IniV finds all product forms 
of the potentially large integer N. The factorization process entails three steps: 1) 
projecting the number iV into base x, 2) solving the system of algebraic equations 
for integral CjX — bj, 3) substituting the base x back into the CjX — bj for factor 
determination. 




(12) 




(13) 
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Computationally the first step requires specifying the base x and projecting the 
number onto it. The base x is specified by the two equations, 



In N , In N 

mx > — lax < — — . (14) 

Cn + 1 Cn 

Following the determination of x, the coefficients aj are determined via starting with 
Q>c N — x ! J = Cn, Ng N = N and following the procedure, 



•V, , -V, - a r r' (15) 

with 

7i _! = Nj-i/x* -1 . (16) 

Take a_,-_i = [7^-1] with a rounding down of 7j_i; if > x then the procedure stops 
and the remaining a$ are set to = with i < j — 1. Otherwise, the subtraction 
process continues. The procedure costs at most 3Cn operations with numbers of at 
most size N (bit size In TV). Due to the bound on C N , this process is at most of the 
size 3 In N operations, one of which is division. 

The next step requires the solution of the algebraic equations for bj in terms of aj. 
There are Cn + 1 equations in Cn + 1 variables. The initial roots are chosen from the 
lowest prime neighborhood around p = CjX — bj, to larger. This procedure is natural 
for the root determinations in the case of the exact Cn- Another case is in choosing 
cascades in the range 10 n to 10 n+1 for n < ln 10 N. Convergence is not analyzed, but 
for well chosen starting values, the number of iterations is typically N%; this is C% for 
the case of Cn variables and equations. The bound is In 3 N. Roughly, if the number 
of operations per iteration is C N (i.e. evaluating a set of similar polynomials) and 
there are Cn roots, then the steps would number as C N . 

If any roots converge to a non-integral value of bj, then the integer iV does not fac- 
tor into Cn numbers. This shortens the number of iterations and steps. The process 
of determining the factorization of iV into the products of two to Cn numbers re- 
quires ln m iV steps, with m denoting an average value from the root selection process, 
the number of variables at each step, the root solving and iteration process including 
shortcuts such as information from lower Cn examples, and an averaging of the short- 
ening the algorithm during the process of lower unknown roots or non-integerness. 
Perhaps, the average results in m ~ 4 or 5, less than In 6 N. 
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To compare with the textbook quadratic sieve method, take the logarithm of the 
steps for both this method and the former, 



aVlniVlnlniV 



m In In N 



(17) 



which is, 



a 2 lniVlnln7V 



m 2 In 2 In N . 



(18) 



The gain is clearly an exponential. The terms compare as a 2 In N = m 2 In In N' and 
N' = exp exp(a 2 / m 2 N) . Consider N = 10 1000 : the numbers are an approximate 



In addition to prime factorization, the product form of the integer into various 
products of factors is determined; this is an additional byproduct of the procedure 
and its computational cost. Furthermore, an explicit knowlege of CV, the number of 
prime factors of a number, would provide a further exponential speedup. 

The procedure here may be adapted to find various forms of number decomposi- 
tions. An example is to find the form of a number written as a sum of products of 
primes. 



exp (a 2 12000) ^ 2 vs. exp(m6) 



1/2 
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Abstract 

An algorithm is given that generates the prime factorization of a number N in 
potentially In 2 N moves, with the complexity limited by the base determination of 
a number in a small degree polynomial. This fact could be a consequence of the 
Riemann hypothesis being true. This work addends previous work on prime factor- 
izations, involving an LU (or QR) factorization. The modification to the previous 
methodology involves a special number on the base reduction of the number N into 
base x. 



1 Introduction 



Factorization of large integers is important to many areas of pure mathematics and 
has practical applications in applied math including cryptography. This subject has 
been under intense study for many years lj; improvements in the methodology are 
especially desired for computational reasons. 

Previous work by the author allowed an estimate of ln m N with m equal to 5 or 
6 due to computational reaons However, a condition has appeared on one of the 
variables that ultimately forces the prime factors to be found in In 2 N steps, which is 
explained here. The greater Riemann hypothesis implies that the factorization could 
be performed in In 4 N moves. 

The case of a single number factorizable into only two prime numbers is a special 
case. The factorization may be performed it seems in In 2 N moves, given some further 
information about the base expansion N = J2 hx l of a number N with a particular 
remainder b . This is remarkable given the currently known bounds on factoring the 
two- channel keys. 

The factorization of a number into m prime factors proceeds first by writing the 
number in a base x format [2j, 

n m 

N = Y,b l x l = 1 l[(a t x + l) , (1.1) 

i=0 i=0 

with n ranging from m to 2m, with the case of n = m being the simplest. 

The numbers b{ are converted into the on to find the factors. In the case of the 
prime factorization the number m labels the maximal number of prime factors, in the 
case that 7 itself is not prime; this is the generic case as discussed in the following. 
Solving the system of equations in (jl.lj) for the on in terms of the b{ results in the 
factorization of the number N in terms of the factors {o>iX + 1) and 7 [2]. 

The case in which all a, are real numbers, and not complex, is considered. The 
case in which the the aij are complex is similarly analyzed with small changes in 
the algorithm. The solutions to the complex numbers a« come in pairs, with the 
(ajX + l)(a\x + 1) generating the prime factors [2j. 

The number 60 — 7 is a special number. If it is not prime itself, then it must be 
a factor of 10 a , that is, a power of our base ten number system. Consider the factors 



OliX + 1 , 
1 



(1.2) 



with x the base number used in the factoring of (jl.lj) and with a« solved for. For m 
the maximum number, this number must be prime, which means that the number is 
a potential decimal 

a iX + 1 = Ni/Di , (1.3) 

such as 2/10 or 7/100. Because iV, is prime, it does not divide into D^, which means 
that Di must be a multiple 10 a \ In general these numbers Ni/Di are decimals, 
which means that the denominator is a power of the base 10. Collecting all of the 
denominators 10 ai results in a net factor, 

rn 

n io ai = io a , (i.4) 
i=i 

and this number must cancel the 7 factor, resulting in either a prime number or an 
integer. If the latter, then the factor of 10 a must cancel the 7 number resulting in 
unity. 

Thus we arrive at the result: In the case that m is the maximal integer and not 
all cx.i are fractions, &o is a number 10 a . This means that one must only test the base 
decompositions 

m 

N = Y,b i x i (1.5) 

i=0 

in which 6 is of the form 10 a with a an integer; the cost is a 'smeared' function of In TV 
because there is potentially a function for determining the integrality of a without 
testing all a from to ln(10, N) (and a = —00). The solution for the ctj follows from 
an LU (or QR) decomposition, which is of the order In 3 iV steps. 

Exceptional Case: 

In the case that all cti are integral, with a number iV factoring into n prime 
numbers, then 

OiX + 1 = Ni , (1.6) 
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with all Ni prime numbers. This includes the case of two prime numbers iV = pq. 
In this case 7 itself must be a prime number. However, this case is quite special as 
the factorization in (jl.lj) must result in all integral values aj. The result are a very 
special set of numbers, 

7 = 60 , 

7(aia 2 + ai« 3 + ...) = b x 
7(aia 2 «3 + perms) = b 2 

(1.7) 

in which all oti are such that ctiX + 1 is prime, with a, integer, and also with 7 prime. 
These numbers, and the factorization, should be interesting to construct. Of course, 
the 10 a test of bo should rule out these numbers generically. 

Cubic and Quartic Cases 

The other two cases required to complete the analysis of the two prime numbers 
composing a number N are the cubic and quartic examples, as analyzed in [2|. These 
cases span the forms, 

N = -y'[[(a i x + l) , (1-8) 

with aij in complex pairs and the pairwise product generating the prime factors. In 
the case of a cubic there is a single pair, and in the case of the quartic there are two 
pairs. The 7 is bo, which is 10* or a prime number. 

Comments on Base Determination 

Empirical observation, with the use of a Matlab program, indicate that the fac- 
torization of a number N into the quadratic case 

N = ax 2 + bx + 10* , (1.9) 

can be performed with an approximate In iV solutions. That is, given the remainder 
term 10*, there appears to be at most IniV solutions to the the base x\ this bound 
seems to be even lower in some examples. 
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Knowledge of the totient function 4>(N), and its inverse, might be responsible for 
this delimiting factor (the o~k{N) which are generally determined in terms of <p(N)). 
The totient has been bounded in general by 



<f>(N)> 



N 



(1.10) 



In In AT + 
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In In AT 



For prime N, the totient hits a local maximum, (j)(N) = N—l. The function oscillates 
strongly near these local maximum, being partly due to the presence of small prime 
factors such as 2 or 3 which make the factor 1 — l/pj small. However, for numbers 
N which are between these prime N's, or local maximum, the oscillations are much 
smaller (e.g. of strength In AT) and the function is relatively stable. A reasonable 
initial value for a numerical routine is chosen for 4>(N) and bf, the parameters can 
be computed from the system of equations relating 4>(N) and its moments o~k(N). 
Apparently, the case of a number A" factoring into two numbers p and q which are of 
the same size in digits is easier to factor in this approach than with two numbers of 
different sizes. 

k ■ 

Case of N = ]J p/ : Multiple Factors 

The case of multiple prime numbers in the expansion of a number, such as A^ = 

k 

Yip/ , is examined by a generalization of the two prime factor case. According as in 
[2], the polynomials grow in size in the base expansion, 



with m bounded from Cn to 2Cn- The Cn is Including the required divisor 

functions o~i(N) allows the systems of equations to be solved for. The divisors can be 
written in terms of the totient function 4>(N), for classes of numbers with Cn fixed. 

Solving the systems of equations for foj, x and 4>(N) generates the prime numbers 
Pj and coefficients kj, at fixed Cn- The number of possibilities increases, however, 
because there are from Cat to 2Cn equations in the individual system sets. There 
are also additional branch cuts; the use of the branch cuts may generate as check on 
the actual unique solution to these parameters. The solution to the prime factors is 
again achieved in 0(1) operations, set by the number of prime factors. 

The cases of b = 10* or 6 prime are handled individually by the direct solution 
of the systems of equations for bi, x. 



in 



(1.11) 
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Discussion 

Previous work in [2j generated an algorithm for determining the prime factors of 
a number N, using base expansions with the base determination the limiting factor 
on the complexity. 

The factoring of numbers can be achieved in polynomial time, albeit at an order 
which is unexpected in the literature. The number of remainder terms in the base 
expansion, the 10* , is generically In N in number; the number 10* could be replaced by 
a prime number, which is a special case. The further inverse determination of the base 
x following from each case, although the algorithem is apparently unknown, appears 
to number in at most In N cases; this results in the complexity of factoring a number 
to be in In 2 N compuational steps. An algorithm for the base determination from the 
remainder term could in principle be performed in 0(1) steps, even if a table is known 
(this table is similar to a table of pythagorean triples, upon further manipulation of 
the base determination with the roots being the rational numbers). 

In the case of a number containing CV prime factors, solving a system of coupled 
algebraic equations of degree Cat to 2CV is required [2j, with remainder term in the 
base expansion of the form 10 l or a prime number. These polynomials are of degree 
2 to 4 in the case of numbers N = pq, i.e. two prime factors. 

These equations are Diophantine in the sense that the solutions to the parameters 
are integral. Solving these equations requires the techniques for analyzing low degree 
polynomials, potentially beyond the quintic. Numerical methods are straightforward 
to implement. 

The analysis could also generate an indirect determination of the totient function 
<ft, and the related divisor functions cr^, given an algorithm for the base determination. 



5 



References 

[1] R. Crandall and C. Pomerance, Prime Numbers, A Computational Perspective, 
Springer- Verlag Inc., (2001). 

[2] Gordon Chalmers, Fast Factoring of Integers, physics/0503159, 



6 



